An Architectural Framework for Integrated Multiagent Planning, Reacting, and Learning
نویسنده
چکیده
Dyna is a single-agent architectural framework that integrates learning, planning, and reacting. Well known instantiations of Dyna are Dyna-AC and Dyna-Q. Here a multiagent extension of Dyna-Q is presented. This extension, called M-Dyna-Q, constitutes a novel coordination framework that bridges the gap between plan-based and reactive coordination in multiagent systems. The paper summarizes the key features of Dyna, describes M-Dyna-Q in detail, provides experimental results, and carefully discusses the benefits and limitations of this framework.
منابع مشابه
A Multiagent Variant of Dyna-Q
This paper describes a multiagent variant of Dyna-Q called M-Dyna-Q. Dyna-Q is an integrated single-agent framework for planning, reacting, and learning. Like DynaQ, M-Dyna-Q employs two key ideas: learning results can serve as a valuable input for both planning and reacting, and results of planning and reacting can serve as a valuable input to learning. M-Dyna-Q extends Dyna-Q in that planning...
متن کاملAn Integrated Human Resource Planning Framework for Project-based Organizations in Oil and Gas Industry
The complexities of the oil industry, combined project-based organizations’ complexities, have led the traditional planning of HR being failed. The success of these organizations is based on integrative human resource planning. To this end, the purpose of this study was to determine the factors and components of human resource planning in oil and gas project-based organizations and providing an...
متن کاملA Multiagent Reinforcement Learning algorithm to solve the Community Detection Problem
Community detection is a challenging optimization problem that consists of searching for communities that belong to a network under the assumption that the nodes of the same community share properties that enable the detection of new characteristics or functional relationships in the network. Although there are many algorithms developed for community detection, most of them are unsuitable when ...
متن کاملMeta-heuristic Algorithms for an Integrated Production-Distribution Planning Problem in a Multi-Objective Supply Chain
In today's globalization, an effective integration of production and distribution plans into a unified framework is crucial for attaining competitive advantage. This paper addresses an integrated multi-product and multi-time period production/distribution planning problem for a two-echelon supply chain subject to the real-world variables and constraints. It is assumed that all transportations a...
متن کاملModeling Cooperative Multiagent Problem Solving as Decentralized Decision Processes
We present a formal framework for modeling and control of cooperative multiagent problem solving. Such a framework is crucial for understanding and evaluating various design choices based on first principles. This framework allows us to define precisely the notion of optimal cooperation and analyze its complexity. Communication decisions are introduced to allow agents to reason explicitly about...
متن کامل